Speech analysis/synthesis/conversion by using sequential processing

نویسندگان

Panuthat Boonpramuk

Tetsuo Funada

Noboru Kanedera

چکیده

This paper presents a method for speech analysis/synthesis/ conversion by using sequential processing. The aims of this method are to improve the quality of synthesized speech and to convert the original speech into another speech of different characteristics. We apply the Kalman Filter for estimating the auto-regressive coefficients of ‘time varying AR model with unknown input (ARUI model)’, which we have proposed to improve the conventional AR model, and we use a band-pass filter for making ‘a guide signal’ to extract the pitch period from the residual signal. These signals are utilized to make the driving source signal in speech synthesis. We also use the guide signal for speech conversion, such as in pitch and utterance length. Moreover, we show experimentally that this method can analyze/synthesize/convert speech without causing instability by using the smoothed auto-regressive coefficients.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reducing over-smoothness in HMM-based speech synthesis using exemplar-based voice conversion

Speech synthesis has been applied in many kinds of practical applications. Currently, state-of-the-art speech synthesis uses statistical methods based on hidden Markov model (HMM). Speech synthesized by statistical methods can be considered over-smooth caused by the averaging in statistical processing. In the literature, there have been many studies attempting to solve over-smoothness in speech...

متن کامل

Codec integrated voice conversion for embedded speech synthesis

Voice conversion technologies transform individual characteristics of speech patterns while preserving the original content, and can be widely used in speech processing. Considering limited system resources, in particular, of embedded concatenative speech synthesis, voice conversion may reduce the memory consumption of the acoustic database. Voice conversion enables the intra-gender or cross-ge...

متن کامل

On glottal source shape parameter transformation using a novel deterministic and stochastic speech analysis and synthesis system

In this paper we present a flexible deterministic plus stochastic model (DSM) approach for parametric speech analysis and synthesis with high quality. The novelty of the proposed speech processing system lies in its extended means to estimate the unvoiced stochastic component and to robustly handle the transformation of the glottal excitation source. It is therefore well suited as speech system...

متن کامل

Issues in Thai Text - to - Speech Synthesis : The NECTEC Approach 1

This paper presents all the essential issues in developing the text-to-speech synthesis for Thai text analysis, prosody generation and speech synthesis. In the text analysis, problems in Thai text processing can be decomposed into the models of sentence extraction, phrase boundary determination and grapheme-to-phoneme conversion. The syllable duration and F0 contour generation rules are include...

متن کامل